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ABSTRACT 

The phases of the Fourier modes appearing in a plane-wave expansion of cosmologi- 
cal density fields play a vital role in determining the morphology of gravitationally- 
developed clustering. We demonstrate this qualitatively and quantitatively using sim- 
ulations. In particular, we use cross-correlation and rank-correlation techniques to 
quantify the agreement between a simulated distribution and phase-only reconstruc- 
tions. The phase-only reconstructions exhibit a high degree of correlation with the 
original distributions, showing how meaningful spatial reconstruction of cosmological 
density fields depends more on phase accuracy than on amplitudes. 
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1 INTRODUCTION 

The standard theory of the origin of the large-scale structure 
of the universe involves the assumption that the structure we 
see today grew by gravitational instability from initial small 
fluctuations in the density field. In most popular variants of 
this model, particularly those involving cosmic inflation, the 
initial fluctuations are of a particularly simple form known as 
a Gaussian random fleld. Gaussian random flelds are useful 
because many properties of Gaussian random density flelds 
can be calculated analytically (e.g. Bardeen et al. 1986). 
Some direct motivation for such an assumption emerges from 
inflationary models, wherein the fluctuations are generated 
by quantum oscillations of the scalar fleld driving inflation 
(the "inflation"). Even if inflation turns out to be incorrect, 
however, the Central Limit Theorem tends to produce Gaus- 
sian fluctuations in any linear process, so that they are the 
most generic form for small initial conditions and a natural 
default assumption. 

One particularly interesting property of Gaussian ran- 
dom fields is that the requirement for the density contrast 
(5(x) = [p(x) — po]/po to be a Gaussian random fleld is equiv- 
alent to that the real and imaginary parts of its Fourier 
components Sk, where 



5(k) = i5(k)|exp(i0J, 



(2) 



5(x) = ^ ^ (5(k) exp(ik ■ x). 



(1) 



are independently distributed. In other words, the Fourier 
modes S(k), 
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possess phases 0k which are independently distributed and 
uniformly random on the interval [0,27r]. As the density 
fleld is simply a sum over a large number of Fourier modes 
and if the phases of each of Fourier modes are random, the 
Central Limit Theorem guarantees that the resulting su- 
perposition of the one-point probability distribution 'P{S) is 
close to Gaussian and that all of the fleld's joint probabil- 
ity distributions are multivariate Gaussians. The statistical 
properties of an isotropic Gaussian random fleld are then 
completely specifled by its second-order statistical quantity: 
the covariance function, or alternatively, its power spectrum 
P(fc) = {5'{k)). 

In the framework of gravitational instability, the growth 
of fluctuations can be understood analytically when the den- 
sity fluctuation amplitude is small compared to the mean 
density; the linear perturbation theory tells us that each 
Fourier mode grows with the same rate independent of 
wavenumber and the statistical distribution of S remains 
constant except its variance. 

The linear theory breaks down when (J^) is compara- 
ble with unity or beyond, and different Fourier modes start 
coupling. One way to look at the mode coupling is that S 
is always constrained to value 5 > —1. When the perturba- 
tion is small, the tail of P(<5) in the part of (5 < —1 assigned 
by Gaussian distribution is negligible, because probability of 
5 < — 1 is small. When the density fleld evolves beyond the 
linear regime, i.e., = {S^) ~ 1, a long tail at high 3 is gen- 
erated while lower bound is conflned at 5 = —1. Gaussian 
condition is therefore invalid, in that mode coupling effect 
causes the initial condition to skew. Terms in the evolution 
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Figure 1. Visual demonstration of the importance of phase in- 
formation for clustering morphology. Plate (a), (c) and (e) have 
the same phase configuration, so do plate (b) and (d). Plate (a), 
(d) and (f), however, share the same power spectrum, or alterna- 
tively, two-point correlation function, so do (b) and (c). See the 
text for details. 

equations for the Fourier modes that represent couphng be- 
tween different modes are of second (or higher) order in 5 
and these are neglected when first-order perturbation theory 
is considered. Phases of Fourier modes therefore are there- 
fore coupled together in a way which is yet to be fully elu- 
cidated but which has been recently investigated by Chiang 
& Coles (2000) and Coles & Chiang (2000). 

One of the reasons for studying Fourier phases in depth 
is the question of possible primordial non-Gaussianity of 
the initial density distribution of the Universe. For exam- 
ple, there have been claims of non-Gaussianity from anal- 
ysis results in the COBE DMR sky maps of cosmic mi- 
crowave background using a number of different diagnos- 
tics: bispectrum analysis (Ferreira et al. 1998); the wavelet 
transform (Hobson et al. 1999); and Minkowski function- 
als (Schmalzing & Gorski 1998). The most direct practical 
approach, however, is through the distribution of Fourier 
phases. This diagnostic can also avoid the subtlety of more 
standard methods that depend on the distribution function 
7'((5)(Scherrer et al. 1991), in that a density field with Gaus- 
sian single-point density distribution 1^(5) is not necessarily 
a Gaussian field. Phase information is also important as a 
statistical diagnostic of non-linearity when the evolution of 
clustering in the non-linear regime which is generally in- 
tractable analytically. Indeed, , as we shall show, it is also 
closely related to the morphology of gravitational clustering 
and through this to the dynamical origin of structure. 

The layout of this paper is as follows. In section 2 we 
present a visual demonstration of the link between Fourier 
phases and clustering morphology. Section 3 contains a brief 
discussion of some of the properties of phases and some 
pitfalls that must be avoided when using them as part of 
a clustering descriptor. In Section 4 we introduce a cross- 
correlation parameter S and a rank-correlation r as quanti- 
tative measures of the agreement between two distributions. 
In Section 5 we display the results of the correlation tests 
between sample distributions and phase-based reconstruc- 
tions, and between simulations evolving from different ini- 
tial power spectra but the same initial phase set. A brief 
discussion of the results follows in Section 6. 



2 VISUAL DEMONSTRATION 

To give a qualitative, visual description of the key ideas 
in this paper consider Fig. 0, in which we isolate the role 
of phases in determining clustering morphology. Plates (a) 
and (b) are two example realisations from two 2D N-body 
experiments, evolving from different initial power-law power 
spectra and different initial phase sets. Plate (a) is evolved 
from power spectral index n = —1, and (b) from n — 1. 
We perform a Fourier transform on both realisations, e.g., 
S^ix.) = X;5kexp(ik • x), where <5^ = exp(i(^^). Plate 
(c) is obtained by taking the inverse Fourier transform from 
the combination of the phases of Fourier modes from (a). 



and the amplitudes from (b), i.e., .7^~^[l(5kj exp{i(f)^)]. Plate 
(d) is from the phases from (b) but amplitudes from (a). 
Therefore, plate (a) and (c) share the same phase configura- 
tion, so do (b) and (d). It is easy to see resemblance between 
(a) and (c), and between (b) and (d). Note that plate (a) 
and (d) have the same power spectrum, or equivalently, two- 
point correlation function, as do plate (b) and (c). Plate (e) 
is the inverse Fourier transform from only the phases from 
(a): [exp{i(j>^)]; it therefore retains only the phase in- 
formation from (a). In plate (f), each mode keeps the same 
amplitude so its power spectrum is unchanged (i.e. the same 
as plate (a) and (d)) but the phases are redistributed ran- 
domly among the modes before inverse Fourier transform. 
Again, plate (e), the phase-only reconstruction (hereafter 
phase-only reconstruction), resembles the original distribu- 
tion plate (a), and plate (c), the same-phase amplitude- 
swapped reconstruction (hereafter amplitude-swapped re- 
construction), but (f) which has the same power spectrum 
as (a) but random phases (hereafter random-phase recon- 
struction), is featureless. This experiment suffices to show 
very clearly how phases determine morphology. 

Another interesting property of Fourier phases in clus- 
tering morphology is that two realisations evolving from the 
same initial random phase set, though different power-law 
power spectra, will have their extrema at the same locations. 
The fact that Plate (a) and (b) are evolved from different 
initial phase sets is for demonstration purpose. 



3 QUANTIFYING PHASE SHIFTS 

Fourier phases reflect the locations of spatial 'events' more 
than Fourier amplitudes do. For a single hypothetical spike, 
represented by the Dirac 5-function Sn{x — xo), the am- 
plitudes are constant and phases are kxo. This has a white- 
noise spectrum, but very strong phase correlation. The phase 
configuration of a spike is very similar to that of a single 
density peak evolved from ID Zel'dovich approximation; see 
Chiang & Coles (2000) for details. A translation x of the 
Dirac 5- function density field, Sd{x — xq — x'), has no ef- 
fect on the Fourier amplitudes, but the phases now become 
k{xo + x'), which suffers a shift by a linear term proportional 
to wave number k. This dependency of phases on the choice 
of origin means that some care must be taken when trying 
to extract meaningful information. Some previous studies fo- 
cused on the evolution of individual phases away from their 
initial values (Ryden & Gramann 1991; Soda & Suto 1992; 
Jain & Bertschinger 1998). The mean deviation from the 
initial phase can be defined as 

Acb{k,a) = {|A0(k,a)|) = MKa) - cb{k,a,)\}, (3) 

where the averages are performed over the different modes 
within a shell in k-space whose wave numbers lie in the range 
k — 0.5 < jkj < fc + 0.5. As long as there are enough modes, 
the maximal value of A(fi{k,a) is 27r/3 (Jain & Bertschinger 
1998), which can be understood as a variable obtained from 
the change from a random field to a hypothetical spike, i.e., 
from random phases to a very ordered state, (f){k,a) = kxo{ 
mod 2n), where xo is the location of the spike and takes on 
any values except zero. This statistic will change following 
the translation of 'events', e.g., A(f>{k,a) = tt if the hypo- 
thetical spike is shifted to the origin. 
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4 CROSS AND RANK CORRELATIONS 

Although the examples displayed in Section 2 serve to 
demonstrate the importance of phases, it is necessary to 
measure objectively how well the phase-based reconstruc- 
tions compare not only with the original distribution, but 
also with reconstruction from, say, random phases. The tool 
we use here is the cross-correlation statistic introduced by 
Coles, Melott & Shandarin (1993). A correlation coefflcient 
is defined as 



!)l/2 



(4) 



where 5ij and Slj represent two density distribution and S 
and 5^ are their mean densities, respectively. The indices i 
and j label the pixel positions in the two-dimensional sim- 
ulations we use here for illustration. The parameter S com- 
pares the density value of each grid point at in the 
original distribution S with the corresponding grid point of 
the reconstruction S^; averages are taken over all grid points. 
Sss^ ~ 1 denotes 'completely positive correlation', a perfect 
agreement between 5 and 5^ , and a value of zero indicates 
the two distributions are uncorrelated. What is useful about 
this test is that it compares the morphology between two dis- 
tributions, point by point, but does not take into account 
their variances. This is because Sss^ = 1 when S = C5' , C 
being a constant. If the structures of two distributions are 
different, l^l is less than unity. 

Cross-correlation tests the spatial correspondence of the 
locations of 'events', such as clusters of points or edges, be- 
tween two distributions. Even the comparison by a small dis- 
placement between two identical periodic distributions will 
result in low value of S. It should be pointed out, therefore, 
that this grid-by-grid test is severe. As we have explained, 
phases are related to preservation of locations of 'events'. 
Relative magnitudes of the 'events', however, are not pre- 
served when the information of Fourier amplitudes is par- 
tially or totally lost. To relax the test, comparisons are made 
not only between raw density distributions S and S'^ , but 
also between smoothed distributions 5(x, ii) and 5^{x,R). 
For this purpose, a Gaussian window function is chosen to 
smooth the field. 



<5(x,i?) 



j2 

a X 



5(x ) (V27ri?)" exp(- 



(5) 



As well as the linear cross-correlation S, we also use 
a non-parametric rank-correlation, Kendall's t parame- 
ter(Kendall and Gibbons 1990). This parameter uses the 
relative ordering of ranks to measure the degree of agree- 
ment between two compared distributions. For any two grid 
points (j,j) and (i ,j ), the given value is -1-1 if the rela- 
tive ordering of the density values of S{i,j) and S{i , j ) is 
the same as S^{i,j) and S^{i ,j ), i.e., either both decreasing 
or both increasing from {i,j) to (i ,j ); —1 when one pair 
is increasing and the other is decreasing. The r parameter 
compares N{N — l)/2 pairs from total grid points and 
is normalized to [—1, 1]. Rank correlation measures such as 
this do not look specifically for linear association between 
the image and reconstruction, but for one-to-one ordering of 
the values in one relative to those in the other. If there is 
strong non-linear association, then r can be close to unity 
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Figure 2. Plots of cross-correlation coefficiont S and rank- 
correlation coefficient r against smoothing scale R (in com- 
puter grid units) for the comparison between sample distributions 
and its phase-based reconstructions. Phase-only and amplitude- 
swapped reconstructions retain the morphology of the original 
structure of the sample realisation, which is shown by a high 
degree of cross-correlation. Random-phase reconstruction, how- 
ever, bears no resemblance to the original structure, hence low S. 
Rank-correlation compares the relative magnitudes of the density 
peaks. Due to the loss of the information on the Fourier ampli- 
tudes in the reconstructions, the relative ordering of the ranks 
of the density magnitudes is partially disturbed. The phase-only 
reconstructions keep the ordering better than amplitude-swapped 
reconstructions. 



even though S may be small. Kendall's r parameter is used 
here as an auxiliary test, which is again "softened" with a 
Gaussian window function as in eq. (H). 



5 RESULTS 

Fig. ^ shows cross-correlation coefficient S and rank- 
correlation coefficient r drawn against smoothing scale R 
in computer grid units. Each panel includes correlations be- 
tween the sample distribution and phase-only, amplitude- 
swapped reconstructions. Correlations of random-phase re- 
constructions are calculated only in S. On the top-left panel 
of Fig. the sample distribution evolved from spectral index 
n — —1 (Fig. la) is compared with phase-only reconstruc- 
tion(Fig.le), and with amplitude-swapped reconstruction 
(Fig.lc). The random-phase reconstruction (Fig. If) is also 
compared for reference. The sample distribution of lower- left 
panel is evolved from n = 1 (Fig. lb). Even before smooth- 
ing, the higher degree of cross-correlation from phase-only 
and amplitude-swapped reconstructions than from random- 
phase reconstruction shows the ability of phases to retain the 
morphology of the sample distributions. Non-resemblance of 
random-phase reconstructions is shown through low value 
of 15*1. That smoothing of the reconstructions increases (for 
amplitude-swapped it even decreases) certain level of 5* in- 
dicates the limitation of phases on reconstruction with the 
loss of amplitude information. 
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Figure 3. Comparison of morpliology of two different stages in 
N-body simulations in terms of cross-correlation coefficient S be- 
tween realisations evolving from different initial power-law power 
spectra but the same initial phase set. The plots are drawn against 
smoothing scale R, as in Fig. (H). 



The rank-correlations, on the other hand, show that 
phase-only reconstructions have higher agreement than 
amplitude-swapped ones on relative density magnitudes, 
which is due to the 'twist' on the latter's power spectra. 
Surely, the loss of the Fourier amplitude information de- 
stroys part of the signal, as shown in the rank-correlation of 
the right panels. On the morphology of gravitational cluster- 
ing, however, phases play a much more important role than 
amplitudes. 

It emphasizes this point still further to test the corre- 
spondence between simulations evolving from different ini- 
tial power spectra but the same initial phases. In Fig. 
we compare the morphology for some relevant examples in 
terms of cross-correlation coefficient S deployed above. In 
particular, we show realisations obtained by evolving A'^- 
body experiments from different 2D initial power-law power 
spectra n = —1, 1 and 2. The left panel is the com- 
parison of stage c in which the scale of non-linearity is 
kNL = 64fc/, and the right panel is that of later stage / 
with kNL = 8fc/(Chiang & Coles 2000, Beacom et al. 1991). 
We use the fundamental mode kf = 27r/L as length unit, 
where L is the length of the side of the simulation square. 

Before smoothing, there is little or no correspondence 
between any of the realisations. This is due to the severe 
point-by-point test of the cross-correlation. After smoothing, 
however, there is a dramatic improvement between in the 
correspondence between n — —1,0, n = 0, 1, and between 
n — 1,2. We can examine the morphology evolving from the 
same phase set by the characteristic scale fcivz,. For example, 
kNL for stage c corresponds to 8 computer grid units, thus 
the smoothing on scales beyond 8 grid units erases non- 
linearities, while the larger scale structure remains in the 
linear regime. The linear growth of the density fluctuations 
set up by the same initial phases depends only on time. 
The comparison therefore indicates the intrinsic difference 
in clustering morphology arising from different initial power 
spectra. What we see on the scales beyond kNL is that the 
correspondence is low between n = —1 and n = 1, 2. 

The correspondence deteriorates when the difference be- 
tween the spectral indices increases, i.e., the difference in in- 
trinsic morphology is significant, or when the evolution goes 
into highly non-linear regime, where particles move away 
from their initial Lagrangian position and interact with each 
other non-linearly. Simulations evolving from the same ini- 
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Figure 4. Scatter diagrams of cell density between smoothed 
original distributions and reconstructions. The smoothing scale 
is two grid units to produce a continuous distribution. In each 
panel only one in four grid points in each spatial direction on 
the 512^ grid are sampled: 16384 points and a straight line with 
slope equal to unity is added for reference. The top-left panel is 
a comparison between Fig. 1(a) and Fig. 1(e) , the bottom- left is 
between Fig. 1(a) and (c), and the bottom-right is between (b) 
and (d). 



tial phase configuration tend to develop nonlinear structures 
at or near the same spatial locations, but these structures 
appear with different contrast when the initial spectra are 
different. For example, filaments appear in the nonlinear 
regime in all cases, but for spectra with large n these tend to 
be less well defined and broken up into clumps. Our statis- 
tic S takes into account both the position and amplitude 
of structures that form so it indicates a deteriorating agree- 
ment for very different spectra. Nevertheless, it is clear that 
there is strong imprint upon the morphology of the initial 
phases resulting from the process of gravitational cluster- 
ing. This can also be seen visually in the pictures shown by 
Beacom et al. (1991). 

The cross-correlation coefficient being a single number, 
so one cannot infer from it precisely how the phase-based 
reconstructions perform relative to the morphology of the 
original distribution. One intriguing question is how the den- 
sity regions from reconstructions can be compared to those 
from the original distributions. Is it 5 oc 5^ that produces 
high value of 5*? From rank-correlation coefficient r, this is 
is not the case. In order to get more information between the 
two distributions, we also made a grid-by-grid comparison 
between them. A scatter diagram can be drawn in logarith- 
mic scales for the comparison between each cell density from 
phase-based reconstructions against the corresponding one 
from the original distribution. If 5 oc 5^ , the points scatter 
along a straight line of slope equal to unity, and the relative 
magnitude of 'events' is preserved by a linear scale factor. 
Fig. ^ shows the scatter diagram between original distribu- 
tion and phase-only, amplitude-swapped reconstructions for 
n — 1 and n = — 1. Not all the points scatter along the 



© 0000 RAS, MNRAS 000, 000-000 



The Importance of Fourier Phases for the Morphology of Gravitational Clustering 5 



Figure 5. Scatter diagrams of cell density between simulations 
evolving from the same phase set. Here we choose between spec- 
tral index n = —I and n = 0, and between n = —1 and n = 2. 
The smoothing scale is four grid units. As in Fig. ^ 16384 points 
are sampled and a straight line with slope equal to unity is added 
for reference. 



straight line, as is suggested. Instead, most points scatter 
horizontally, particularly for phase-only cases. Also notice 
also that, in both reconstruction cases, the density values 
span only roughly one order of magnitude and the points 
that do not align well correspond to very small fluctuations. 
High values of S result from phase configuration preserving 
the locations of high-density 'events' with large magnitudes 
in the original structure. The relatively fiat phase-only re- 
constructed density distribution is caused by the fiat power 
spectrum, in which the Fourier amplitude for each mode is 
squashed into unity. The lower two panels are scatter dia- 
grams of the amplitude-swapped reconstructions against the 
originals, i.e., in Fig. |l| between (c) and (a) and between (d) 
and (b). The difference is amplitude-swapped reconstruc- 
tions have the power spectra from the alternative sample 
distributions. The n — —1 amplitude-swapped reconstruc- 
tion has the power spectrum from the original distribution 
of n = 1, which gives more power on small scales in the re- 
construction, the n = 1 reconstruction, on the other hand, 
has more power on large scales. This adverse effect causes 
the points of the low fluctuations to spread on the scatter 
diagram, which, however, doesn't decrease much the cross- 
correlation coefficient S between phase-only and amplitude- 
swapped reconstructions. This is in accord with with our vi- 
sual impression that our eyes pick up the maxima between 
the distributions for comparison. 

In Fig. 1^ the scatter diagrams for the realisations evolv- 
ing from the same phase set are also produced. Here only two 
cases are chosen, i.e., between n = — 1 and 0, and between 
n = — 1 and 2. The left two panels are comparison between 
n — —1 and n = 0, and the scattering is expected, which 
nonetheless follows the straight line. For the right panels, at 
early stage c, the deviation from the straight line is more 
isotropic, hence the correspondence is low. At late stage /, 
there is even more deviation, which comes from the map- 
ping from the low-density regions of stage f,n — 2. The 
reason is as follows. For the realisation of n = 2 to reach 
the same level of non-linearity, the variance is higher than 
that of n = — 1, and there are a substantial number of voids. 
After smoothing to create continuous distribution, the void 
regions are smoothed as low-density regions, which is seen 
in the bottom-right panel. 



6 CONCLUSION 

We have shown the importance of Fourier phases on mor- 
phology of gravitational clustering via qualitative and quan- 
titative demonstrations. It is interesting to remark upon the 
similarity of the results we have obtained here and those 
presented in Coles et al. (1993). The latter authors were in- 
terested in the ability of simple analytic methods to repro- 
duce the clustering displayed by full N-body computations. 



They showed in particular that the Zel'dovich approxima- 
tion (Zel'dovich 1970) could reproduce the full numerical 
results quite well, with cross-correlations similar to those 
we have found here. The Zel'dovich approximation works as 
well as it does in this respect because it places the caus- 
tic surfaces forming sheets and filaments near to the cor- 
rect location in the N-body experiment. In other words, the 
Zel'dovich approximation has a high phase fidelity. It is less 
good at getting the amplitudes right. As we have shown, 
however, phases dominate the morphology. 

With a new colour representation technique (Coles & 
Chiang 2000), phase information can be used to distinguish 
between non-Gaussianity induced by gravitational cluster- 
ing and that by other mechanisms. For example, in the web 
page 

\protect\vrule widthOpt\protect\href {http : //www .nottingham. ac .uk\i 

the phase configuration of the non-Gaussian temperature 
fluctuations on the CMB sky induced by cosmic strings can 
be seen to be intrinsically different from that of hierarchi- 
cal clustering developed in N-body simulations. A new al- 
gorithm based on the analysis of the phase distribution of 
Fourier components is recently devised to extract noise from 
point sources from the CMB signal(Naselsky et al. 2000), 
which is another example practice of the close link between 
Fourier phases and morphology. 

Future large-scale galaxy redshift surveys and mi- 
crowave sky maps will reveal much morphological informa- 
tion about large-scale structure in the Universe. Existing 
statistical technology, however, is still dominated by second- 
order methods that are blind to phase information. Our suc- 
cess in extracting this information will therefore depend on 
the development of statistical methods sufficiently sensitive 
to the key ingredient: the distribution of Fourier phases. 
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